Optimal Set Recommendations Based on Regret

نویسندگان

  • Paolo Viappiani
  • Craig Boutilier
چکیده

Current conversational recommender systems do not offer guarantees on the quality of their recommendations, either because they do not maintain a model of a user’s utility function, or do so in an ad hoc fashion. In this paper, we propose an approach to recommender systems that incorporates explicit utility models into the recommendation process in a decision-theoretically sound fashion. The system maintains explicit constraints on the user’s utility based on the semantics of the preferences revealed by the user’s actions. In particular, we propose and investigate a new decision criterion, setwise maximum regret, for constructing optimal recommendation sets. This new criterion extends the mathematical notion of maximum regret used in decision theory and preference elicitation to sets. We develop computational procedures for computing setwise max regret. We also show that the criterion suggests choice sets for queries that are myopically optimal: that is, it refines knowledge of a user’s utility function in a way that reduces max regret more quickly than any other choice set. Thus setwise max regret acts both as guarantee on the quality of our recommendations and as a driver for further utility elicitation. Our simulation results suggest that this utilitytheoretically sound approach to user modeling allows much more effective navigation of a product space than traditional approaches based on, for example, heuristic utility models and product similarity measures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recommendation Sets and Choice Queries: There Is No Exploration/Exploitation Tradeoff!

Utility elicitation is an important component of many applications, such as decision support systems and recommender systems. Such systems query users about their preferences and offer recommendations based on the system’s belief about the user’s utility function. We analyze the connection between the problem of generating optimal recommendation sets and the problem of generating optimal choice...

متن کامل

Computing optimal k-regret minimizing sets with top-k depth contours

Regret minimizing sets are a very recent approach to representing a dataset D with a small subset S of representative tuples. The set S is chosen such that executing any top-1 query on S rather than D is minimally perceptible to any user. To discover an optimal regret minimizing set of a predetermined cardinality is conjectured to be a hard problem. In this paper, we generalize the problem to t...

متن کامل

Regret optimality in semi-Markov decision processes with an absorbing set

The optimization problem of general utility case is considered for countable state semi-Markov decision processes. The regret-utility function is introduced as a function of two variables, one is a target value and the other is a present value. We consider the expectation of the regret-utility function incured until the reaching time to a given absorbing set. In order to characterize the regret...

متن کامل

Towards Minimax Policies for Online Linear Optimization with Bandit Feedback

We address the online linear optimization problem with bandit feedback. Our contribution is twofold. First, we provide an algorithm (based on exponential weights) with a regret of order √ dn logN for any finite action set with N actions, under the assumption that the instantaneous loss is bounded by 1. This shaves off an extraneous √ d factor compared to previous works, and gives a regret bound...

متن کامل

A Geometric Traversal Algorithm for Reward-Uncertain MDPs

Markov decision processes (MDPs) are widely used in modeling decision making problems in stochastic environments. However, precise specification of the reward functions in MDPs is often very difficult. Recent approaches have focused on computing an optimal policy based on the minimax regret criterion for obtaining a robust policy under uncertainty in the reward function. One of the core tasks i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009